Distributed Artificial Intelligence Models for Knowledge Discovery in Bioinformatics
نویسندگان
چکیده
The increasing volume of existing information on biological processes and the use of large databases have significantly increased the accessibility of datasets to the scientific community. This has enabled performing an analysis to facilitate the extraction of relevant information or modeling and optimizing tasks in different processes. Parallel to the increasing volumes of information is the emergence of new or adapted distributed computing models such as grid computing and cloud computing. Together with new techniques of artificial intelligence, or more specifically knowledge discovery, these management systems are making it possible to perform a more efficient analysis of the information and are enabling the creation of adaptive systems with learning ability. In the area of distributed artificial intelligence models for knowledge discovery in bioinformatics, ten interesting proposals are presented. These models analyze different biological aspects and simulate the process or user behavior in the health care system. The main characteristics in these proposals are the use of artificial intelligence techniques to analyze the information and extract knowledge. " Bladder Carcinoma Data with Clinical Risk Factors and Molecular Markers: A Cluster Analysis " provides interesting research about bladder cancer. The paper shows the hypothesis that the use of clinical and histopathological data with information about marker is useful to manage treatments of nonmuscle invasive bladder cancers (NMIBC). The authors apply data mining techniques such as hierarchical clustering to create molecular cluster and risk groups. In their experiments , the authors analyze 45 patients with a new diagnosis of NMIBC. They create four groups of patients and categorize the patients according to clinical characters and biological behavior. The authors of " A Linear-RBF Multikernel SVM to Classify Big Text Corpora " use data mining techniques based on classifiers to big text corpora. In particular, they implement a variant of support vector machine (SVM) to reduce the computational cost. The authors show a multikernel SVM with automatic parameterization to improve the results of SVM parameterized under a brute force search. The proposal is composed of a workflow with algorithms to process documents , reduce the dimensionality of the data, and to apply/ provide clustering, training, and prediction. The proposal is analyzed according to the classification results and building time in the dataset TREC Genomics 2005 corpus. In " Analysis of Environmental Stress Factors Using an Artificial Growth System and Plant Fitness Optimization, " the authors analyze how some environment conditions can accelerate the evolution …
منابع مشابه
Computational Intelligence in Bioinformatics
Copyright: © 2014 Nebel JC. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Since the term ‘bioinformatics’ was coined in 1970 [1], the field of bioinformatics has become relatively mature allowing high-throug...
متن کاملLimbform: a functional ontology-based database of limb regeneration experiments
SUMMARY The ability of certain organisms to completely regenerate lost limbs is a fascinating process, far from solved. Despite the extraordinary published efforts during the past centuries of scientists performing amputations, transplantations and molecular experiments, no mechanistic model exists yet that can completely explain patterning during the limb regeneration process. The lack of a ce...
متن کاملKnowledge Acquisition from Distributed, Autonomous, Semantically Heterogeneous Data and Knowledge Sources (KADASH)
ion. For example, the program of study a student in a data source can be specified as Graduate Program (higher level of abstraction), while the program of study of a different student in the same data source (or even a different data source) can be specified as Doctoral Program (lower level of abstraction). 2005 IEEE ICDM Workshop on KADASH 5 The workshop brings together researchers in relevant...
متن کاملCluster Based Cross Layer Intelligent Service Discovery for Mobile Ad-Hoc Networks
The ability to discover services in Mobile Ad hoc Network (MANET) is a major prerequisite. Cluster basedcross layer intelligent service discovery for MANET (CBISD) is cluster based architecture, caching ofsemantic details of services and intelligent forwarding using network layer mechanisms. The cluster basedarchitecture using semantic knowledge provides scalability and accuracy. Also, the mini...
متن کاملSoft Computing Methods in Bioinformatics: a Comprehensive Review
Applications of genomic and proteomic, epigenetic, pharmacogenomics, and systems biology have shown increased a lot, resulting in an explosion in the amount of highly dimensional and complicated data being generated. The data of bioinformatics fields are always with high-dimension and small samples. Genome-wide investigations generate in large numbers of data and there is a need for soft comput...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2015 شماره
صفحات -
تاریخ انتشار 2015